AMENDMENTS TO THE CLAIMS 

The following listing of claims will replace all prior versions, and listings, of claims 
in the application. 




1. \ (Currently Amended) A method of performing a two-dimensional discrete cosine 
transfoun (DCT) using a microprocessor having an instruction set that includes single- 
instructio\ multiple-data (SIMP) SIMD floating point instructions, wherein the method 
comprises: 

receiving a block of integer data having C columns and R rows, wherein each of the 
R raws contains C row data values, wherein the block of integer data is 
indicative of a portion of an image; and 

for each row, 

loading the Cstow data values of the row into registers; 
converting the>C row data values into floating point form, wherein the 

registers eadi hold two floating point row data values; and 
performing a plurality of weighted-rotation operations on the values in the 

registers, wherem the weighted-rotation operations are performed 

using SIMD floating point instructions. 




2. (Currently Amended) The method of claiitk 1, wherein said converting is accomplished 
using th e _pi2f\v a packed integer word to floatingpoint conversion (pi2fiv) instruction. 



3. (Currently Amended) The method of claim 1, wherein said weighted-rotation operations 
are accomplished using th e pswap, pfinul, and pfpriacc instructions a packed swap 
doubleword (pswapd) instruction, a packed floating-point multipUcation (pfinul) instruction 
and a packed floating-point negative accumulate (pfpnacc) insft}iction . 

4. (Original) The method o^'c^l^^ 1, further comprising: 
for each row, 

altering the arr^g^ent of values in the registers; 
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performing a second plurality of weighted-rotation operations on the values in 
agisters; 

again altering tn^'^aq^gement^the values in the registers; 
performing a third pluralitj^Tw^ghted-rotation operations on the values in 
the registers; 

yet again altering the arratilgement of the"v^4^s in the registers; and 
performing a fourth plurality of weighted-rotatiwKQperations on the values in 
the registers to obtain intermediate floating poiinVaMes. 

5. (Qriginal) The method of claim 4, further comprising: 
(or each row, 

storing the intermediate floating point values to an intermediate buffer. 

6. (Original) The method of claim 5, further comprising: 

for two columnSs^t a time, 

loading damvfrom two columns of intermediate data into each of a plurality of 
registers^ 

performing a plur^l^of weighted-rotation operations on the values in the 
registers, wherei^he weighted-rotation operations for two columns 
are performed in parallel using SIMD floating point instructions. 

7. (Currently Amended) The method of claim 6>swherein said weighted-rotation operations 
for two columns at a time are accomphshed usin^ pfinul, pfsub, and pfadd instructions a 
packed floating-point multiplication (pfmul) instruction\a packed floating-point subtraction 
(pfsub) instruction and a packed floating-point addition (pfahd) instr uction 



8. (Original) The method of claim 6, further comprising: 
for two columns at a time, 

as each weighted-rotation operation is done, storing 
operation results to the intermediate buffer. 



/eighted-rotation 
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9. (Original) The method of claim 8, further comprising: 

\ for two columns at a time, 
\ retrieving weighted-rotation operation results from the intermediate buffer; 
\ performing a second plurality of weighted-rotation operations on the retrieved 
\ values; 

Bgain storing weighted-rotation operation results to the intermediate buffer as 

\ the weighted-rotation operations of the second plurality are done; 
agairi retrieving weighted-rotation operation results from the intermediate 
Nbuffer; 

performiW a third plurality of weighted-rotation operations on the retrieved 
values; 

yet again storing weighted-rotation operation results to the intermediate buffer 
as the w^ghted-rotation operations of the third plurality are done; 

yet again retrieving weighted-rotation operation results from the intermediate 
buffer; \ 

performing a fourth pmrality of weighted-rotation operations on the retrieved 
values; \ 

converting the weighted-rofeation operation results from the fourth plurality to 
integer results. \ 

10. (Original) The method of claim 9, fiirther conmrising: 

for two columns at a time, writing the integV results to an output buffer. 

11. (Currently Amended) A method of performing a discrete cosine transform (DCT) using a 
microprocessor having an instruction set that includes single-instruction multiple-data 
(SIMP) SIMD floating point instructions, wherein the method comprises: 

receiving a block of integer data having C columns andSR rows; and 
for two columns at a time, \ 

loading column data into registers; \ 

converting the column data into floating point form, wherein the registers each 
hold a floating point column data value from two oolumns; and 
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^performing a plurality of weighted-rotation operations on the values in the 
registers, wherein the weighted-rotation operations for two columns 
^rformed in parallel using SIMD floating point instructions. 



12. (Currently Amended) The metho^'^claim 11, wherein said weighted-rotation operations 
are accomplished using pfinul, pfsub, ahA pfadd instructions a packed floating-point 
multiplication (pfinul) instruction, a packed floatingpoint subtraction (pfsub) instruction and 
a packed floating-point addition (pfadd) instruction . 

13. (Original) The method olf^daiijff 11, further comprising: 

for two columns at a time 

as each wei^Bwifer6fation operation is done, storing weighted-rotation 
operatioif results to^ intermediate buffer. 



14v(Driginal) The method of claim 13, further comprising: 
rbr two columns at a time, 

\retrieving weighted-rotation operation results from the intermediate buffer; 
pe^rming a second plurality of weighted-rotation operations on the retrieved 
^\(alues; 

again storihg weighted-rotation operation results to the intermediate buffer as 
the wei^^ed-rotation operations of the second plurality are done; 

again retrieving wfeighted-rotation operation results from the intermediate 
buffer; \. 

performing a third pluralityvqf weighted-rotation operations on the retrieved 

values; \w 
yet again storing weighted-rotation of^ration results to the intermediate buffer 

as the weighted-rotation operatiohs of the third plurality are done; 
yet again retrieving weighted-rotation operation results from the intermediate 

buffer; \^ 
performing a fourth plurality of weighted-rotation opsrations on the retrieved 

values; 
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^converting the weighted-rotation operation results from the fourth pluraUty to 
integer results. 

15. (Original) The i^thod of claim 14, further comprising: 

for two columns at a time, writing the integer results to an output buffer. 

16. (Currently Amended) A computer system comprising: 

a processor having Vn instruction set that includes single-instruction multiple-data 
(SEVLD) S^4D^oating point instructions; and 

a memory coupled to thaprocessor, wherein the memory stores software instructions 
executable by the processor to implement [the method of] a two-dimensional 
discrete cosine transform method, the method comprising: receiving a block 
of integer data having\C columns and R rows, wherein each of the R rows 
contains C row data values, wherein the block of integer data is indicative of a 
portion of an image; and 

for each row, 

loading the C row data values into registers of the processor ; 
converting the C row data values into floating point form, wherein the 

registers each hold two flowing point row data values; and 
performing a plurality of weightea\rotation operations on the values in the 

registers, wherein the weighted-rotation operations are performed 

using SIMD floating point instruVtions. 

17. (Currently Amended) A carrier medium comprising sonware instructions executable by a 
microprocessor having an instruction set that includes single-instruction multiple-data 
(SIMP) SIMD floating point instructions to implement a method of performing a two- 
dimensional discrete cosine transform (DCT), wherein the methodvcomprises: 

receiving a block of integer data having C columns and R rbws, wherein each of the 
R rows contains C row data values, wherein the bl\ck of integer data is 
indicative of a portion of an image; and 

for each row, 
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loading the C row data values into registers; 
)nverting the C row data values into floating point form, wherein the 
registers each hold two floating point row data values; and 
perfo^ing a plurality of weighted-rotation operations on the values in the 
tegisters, wherein the weighted-rotation operations are performed 
using SIMD floating point instructions. 

1 8. (New) A computer system comprising: 

a processor having anv instruction set that includes single-instruction multiple-data 

(SIMD) floating point instructions; and 
a memory coupled to the OTocessor, wherein the memory stores software instructions 

executable by the processor to implement the method of receiving a block of 

integer data having C columns and R rows, wherein the block of integer data 

is indicative of a portion\f an image; and 
for two columns at a time, ^ 

loading column data into regi^ers of the processor; 

converting the column data inta floating point form, wherein the registers each 
hold a floating point column data value from two columns; and 

performing a plurality of weighted-rotation operations on the values in the 
registers, wherein the wei^^d-rotation operations for two columns 
are performed in parallel using SIMD floating point instructions. 

19. (New) A carrier medium comprising software instructions executable by a 
microprocessor having an instruction set that includes^ single-instruction multiple-data 
(SIMD) floating point instructions to implement a method c^f performing a discrete cosine 
transform (DCT), wherein the method comprises: 

receiving a block of integer data having C columns and R\ows; and 
for two columns at a time, 

loading column data into registers; 

converting the column data into floating point form, wherein the registers each 
hold a floating point column data value from two columns; and 




niijg a plurality of weighted-rotation operations on the values in the 
registersT^Rltereiij^e weighted-rotation operations for two columns 
are performed in parallelusing1SOE4B4lQating point instructions. 
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